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In recent years, state and national leaders in the United 
States, backed by public opinion, have taken the view 
that public schools frequently fail to give students a good 
education, causing a myriad of bad outcomes and, in the 
view of some, even putting the national welfare in jeop- 
ardy (Madaus, Raczek, & Clarke, 1997). This perceived 
problem has resulted in policies to hold educators 
accountable for imparting higher academic standards, as 
measured by high-stakes, standardized tests (Elmore, 
Abelmann, & Fuhrman, 1996). These reforms reached a 
crescendo with the passage of the federal No Child Left 
Behind Act of 2001 that required each state to have stan- 
dards on which students would be tested, with a 
demanding and intricate accountability system accompa- 
nying it (Anderson, 2005). Partly because nations seek to 
be competitive in an increasingly global marketplace, 
accountability also has been a frequent part of education 
reform worldwide (Levin, 1998; Tatto, 2006). 



However, policies have the potential to induce costs 
as well as benefits (Darling-Hammond & Wise, 1985), 
and research has repeatedly shown that education reform 
policies may have unintended effects, or what Marshall 
(1998) terms “slippage between policy intent and out- 
comes” (p. 103). Linn (2000) argues that unintended 
negative effects often overshadow positive ones when 
accountability systems have high-stakes testing and calls 
for evaluation of such negative outcomes. 

One hypothesized outcome of education accounta- 
bility policies is pressure that contributes to teacher 
burnout (Farber, 1991). Schools with a strong drive 
toward measurable academic goals tend to have more 
burned-out teachers (Friedman, 1991). Studies also 
point to accountability as increasing stress for teachers 
(McNeil, 2000; Smith, 1991). 

Increases in burnout, of course, could be related to 
teachers leaving the profession (Goddard & Goddard, 
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2006), a critical issue given that increasing enrollments 
and decreasing class sizes, among other factors, have 
made it difficult for school systems to attract and retain 
qualified teachers (Murnane & Steele, 2007). 
Furthermore, teachers are more likely to leave low-per- 
forming and minority districts (Guarino, Santibanez, & 
Daley, 2006), and there is evidence that accountability 
policies exacerbate the problem (Clotfelter, Ladd, Vigdor, 
& Diaz, 2004). 

Thus in the study reported here, we explore relation- 
ships between burnout and teachers’ perceptions of one 
state’s accountability policy — principally its requirement 
that student academic achievement improve as demon- 
strated by results on a high-stakes test. Burnout is a con- 
struct with multiple definitions, but it is generally char- 
acterized by professionals feeling that their efforts are 
ineffective, the tasks never ending, and payoffs not forth- 
coming (Farber, 1991). Empirical evidence from the 
most widely used measure of burnout points to three 
dimensions: emotional exhaustion, or feelings of being 
emotionally drained; depersonalization, or callous treat- 
ment of those one is serving; and reduced personal 
accomplishment, or a decline in one’s feelings of achieve- 
ment (Maslach, Jackson, & Leiter, 1996). Stressors that 
teachers experience often lead to emotional exhaustion 
(Byrne, 1994; Leiter, 1991), which in turn can lead to 
depersonalization (Leiter, 1993). Personal accomplish- 
ment has weaker relationships to the other two burnout 
variables (Maslach et ah), but Byrne demonstrated that 
emotional exhaustion and depersonalization negatively 
relate to personal accomplishment for elementary teach- 
ers. 

Potential Links Between Policy and Burnout 

Numerous variables could potentially mediate the rela- 
tionship between accountability and burnout. Two stand 
out conceptually and empirically: role conflict and 
teacher self-efficacy, which is often referred to as teacher 
efficacy. Role conflict is the occurrence of two or more 
requirements or pressures that are judged incompatible 
(Rizzo, House, & Lirtzman, 1970). Role conflicts include 
dilemmas such as trying to balance quantity of material 
to be covered with quality of the work (Byrne, 1994). 
Teacher efficacy is a cognitive construct that has devel- 
oped out of Bandura’s (1977) work (Tschannen-Moran, 
Woolfolk Hoy, & Hoy, 1998). The concept is difficult to 
define and measure, but a general definition is “a 
teacher’s belief or conviction that he or she can influence 
how well students learn, even those who may be difficult 
or unmotivated” (Guskey, 1987, p. 41). 


Role conflict has demonstrated relationships with 
burnout (Byrne, 1994; Ginsberg, Cohn, Williams, 
Pritchett, & Smith, 1989; Schwab & Iwanicki, 1982). 
Results are inconsistent, with role conflict sometimes 
related to emotional exhaustion and depersonalization 
(e.g., Schwab & Iwanicki, 1982) and other times only to 
emotional exhaustion (e.g., Byrne). Likewise, role con- 
flict also has emerged as a consequence of reforms 
increasing academic standards. Teachers in the United 
Kingdom required to follow a national curriculum and 
log children’s progress felt they had to choose between 
what was to be tested and what they thought important 
(Black, 1994) and between the values of pursuing aca- 
demic achievement and of social and emotional develop- 
ment (Dunham, 1992). In the United States, teachers 
modifying their reading and language arts programs were 
chronically nervous that standardized test scores might 
decrease as a result, although they did not (S. D. Miller, 
1995). 

Studies demonstrate that higher teacher efficacy is 
negatively related to job burnout (Brissie, Hoover- 
Dempsey, & Bassler, 1988; Chwalisz, Altmaier, & 
Russell, 1992). Higher levels of teacher efficacy also are 
strongly related to higher commitment to the teaching 
profession (Coladarci, 1992). Meanwhile, policy analysts 
have found that accountability policies put teachers in a 
position in which they do not feel efficacious. That is, 
many educators do not believe they can influence stu- 
dent learning as demanded by accountability systems 
(Abelmann, Elmore, Even, Kenyon, & Marshall, 1999; 
Conley & Goldman, 1995). 

Another variable, teaching in a low-achieving 
school, is a potential antecedent that might lead teachers 
to experience reduced self-efficacy and additional pres- 
sure from accountability policy, although the evidence is 
less direct. Educators tend to have lower efficacy with 
nonacademic track classes than with academic and hon- 
ors classes (Raudenbush, Rowan, & Cheong, 1992) and 
to feel less able to influence how well low-ability stu- 
dents learn (Cooper, Burger, & Seymour, 1979). As for 
relationships with accountability policies, teachers who 
work with low-achieving students report having less 
faith that administrators would interpret test scores accu- 
rately (Urdan & Paris, 1994). Hence it may be that teach- 
ing in a low-achieving school drives the kind of pressure 
Lipsky (1980) describes for “street-level bureaucrats” to 
exonerate themselves of responsibility for outcomes. 

Based on this empirical evidence, we developed a 
path model linking pressure from accountability policy 
to teacher burnout via the mediators of role conflict and 
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reduced self-efficacy. The model also had lower existing 
school test scores as an antecedent driving more policy 
pressure and reducing self-efficacy although this was a 
tentative hypothesis due to more limited evidence. The 
model is depicted in Figure 1 . 

Given the exploratory nature of the research ques- 
tion, we wanted teachers to tell us about the effect of 
accountability policies in ways beyond Likert scale rat- 
ings. Thus the study included an interview so that teach- 
ers could explain the impact of policies on them. 



Figure 1 . Model of proposed relationships among vari- 
ables. 

Context 

Our study took place in South Carolina, where standard- 
ized test scores traditionally have been among the lowest 
in the United States. The focus of our study had its ori- 
gins in the Education Accountability Act of 1998, which 
resulted in national praise for its high standards 
(“Standards and Accountability,” 2001). It set academic 
standards for each grade and standards-based examina- 
tions in grades 3-8 and 10, the last of which serves as the 
exit exam. The test, called the Palmetto Achievement 
Challenge Test (PACT), included multiple-choice and 
open-response items asking students how they arrived at 
answers. For individual students, the law attempted to 
eliminate the practice of “social promotions,” with par- 
ents of those not performing to grade levels required to 
come in for a conference to establish means to improve 
outcomes. Further poor performance could result in a 
student’s being placed on academic probation or 
retained. At the aggregate level, the state department of 
education was to provide report cards to schools and dis- 
tricts on their performance — both by noting the percent- 
age of students performing well or not in a given year 
and by measuring overall improvement from year to year 
by tracking students. Schools that exceeded expected 
results were to receive recognition and incentives. 
Schools doing poorly were to devise an improvement 


plan, with the state providing assistance to help with 
weaknesses. 

Method 

Setting and Participants 

The study took place in spring 2000 in a part urban and 
part suburban school district (we use the pseudonym 
“Springfield”) with teachers of grades 3-5 (students take 
the PACT beginning in 3rd grade). Slightly more than a 
third of the Springfield’s students in grades 3, 4, and 5 
received free or reduced-price lunch at the time of the 
study, whereas the state average for those grades was 
slightly above 50 percent. The district’s 11 elementary 
schools had PACT scores in the initial year of testing 
ranging from average to well above average. 

We approached principals at nine of the district’s ele- 
mentary schools with the highest and lowest PACT 
scores for permission to have their teachers participate, 
and each allowed teachers to take part. Most principals 
allowed the first author to meet with teachers and 
explain the study, although two principals told their 
teachers about the study themselves using an informa- 
tion sheet provided. 

A total of 100 teachers — 76 percent of all possible 
participants — completed the questionnaires, although 
one was dropped because she was a special education 
teacher, not a regular classroom teacher. The main vari- 
able separating participating teachers from nonpartici- 
pants appeared to be hearing about the study in person 
from the first author, as 61 percent of the nonparticipants 
were informed via their principal or electronic mail. We 
paid teachers $10 for completing the questionnaires, and 
we donated $5 to their schools for each participating 
teacher; all payments were from the first author. 

The teachers had a mean of 13. 1 years’ experience in 
the classroom (median of 13.0), with a range of 1 to 32 
years of teaching. They had a mean of 8.2 years’ experi- 
ence in the Springfield District and 6.1 years at their cur- 
rent school. Nearly two-thirds — 64 percent — had mas- 
ter’s degrees. The mean age was 39.3 years. Ninety were 
female. Participants were almost evenly split on grade 
taught: 35 3rd grade, 31 4th grade, and 33 5th grade 
teachers. 

For the interview portion of the study, the goal was 
to select one of every five teachers completing question- 
naires. Using a median split, we categorized teachers as 
high- or low-burnout based on scores from an emotion- 
al exhaustion subscale described below. We then 
grouped teachers in each category by school and ran- 
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domly selected as proportionately as possible from each 
school. 

Of the 20 teachers selected, 14 — 70 percent — 
accepted. For each teacher who declined, we asked a 
randomly selected teacher from the same school (and the 
same side of the median split) to participate, resulting in 
nine teachers being contacted to fill the six open slots. 
We paid teachers $10 for participating in the interview 
phase of the study, with a donation of $5 again going to 
their school; all payments were from the first author. 

The teachers interviewed were virtually the same 
demographically as the entire sample. One major differ- 
ence was that 5th grade teachers were overrepresented 
(50 percent) and 4th grade ones underrepresented (20 
percent). The interviewer was blind to the burnout score 
of teachers. 

Measures 

Teachers completed a series of questionnaires about their 
views of accountability policies and their work (e.g., self- 
efficacy and burnout). Several measures were not used 
for analyses in this study and thus are not reported here. 

Policy Questionnaire 

Because of the lack of a quantitative measure on teacher 
attitudes toward accountability policies, we devised a 24- 
item instrument, the Accountability Policies Stressors 
and Supports Questionnaire (see Appendix A) that asks 
teachers about perceived stressors and support resulting 
from policies. We pilot-tested the questions with teach- 
ers at a school in a different district than the one studied. 
We utilized data from the measure in testing the causal 
model; as will be reported below, we ultimately reduced 
it to a three-item measure, policy support, with an alpha 
internal consistency coefficient of .80. 

Other measures 

We also utilized other measures: 

Teacher efficacy. We used a shortened version (see 
Tschannen-Moran et al., 1998) of the Teacher Efficacy 
Scale (Gibson & Dembo, 1984) as a measure of two fac- 
tors: general teaching efficacy and personal teaching effi- 
cacy. Personal teaching efficacy reflects teachers’ per- 
ceived ability to effect positive student change. General 
teaching efficacy has been interpreted in different man- 
ners, but it may be considered teachers’ views of the 
impact of constraints outside the classroom (e.g., home 
conditions) that can negatively affect teaching 
(Tschannen-Moran et al.). 


The five-item general efficacy scale has an alpha 
internal consistency coefficient of .72, whereas the five- 
item personal efficacy scale has an alpha of .77 (Hoy & 
Woolfolk, 1993). For the current sample, the alpha coef- 
ficients were .75 for the general measure and .74 for the 
personal efficacy measure. 

Role conflict. We used a shortened version of the Role 
Questionnaire of Rizzo and colleagues (1970) to meas- 
ure “intrarole intersender” conflicts in which individuals 
perceive work tasks as beyond their resources or unnec- 
essary (Schwab, Iwanicki, & Pierson, 1983). This four- 
item subscale was validated with a sample of teachers 
and had an alpha of .78. With the current sample, the 
alpha was .82. 

Teacher burnout. We used the 22-item Maslach 
Burnout Inventory for Educators (Maslach et al., 1996) 
to measure the three subscales of burnout. The first sub- 
scale, emotional exhaustion, has an alpha internal con- 
sistency of .90; the second, depersonalization, has an 
alpha of .76; and the third, personal accomplishment, 
has an alpha of .76 (Iwanicki & Schwab, 1981). For the 
current sample, the alpha coefficients were .90 (emotion- 
al exhaustion), .70 (depersonalization), and .79 (person- 
al accomplishment). 

Semi-structured interview. Following principles delin- 
eated by Patton (1990), we developed an interview ask- 
ing about the impact of accountability policies on teach- 
ers’ work. After pilot tests for clarity and completeness, 
we added a few questions and reworded a few, resulting 
in a semistructured interview of 22 questions. We asked 
open-ended questions about what teachers thought leg- 
islators wanted them to do, followed by questions on the 
impact of accountability policies, including whether 
teachers viewed policies as supportive or stressful; 
whether they saw relationships between policies and 
their efficacy and role conflicts; and whether policies 
affected their relationships with administrators, col- 
leagues, or students. 

Procedure 

We completed quantitative measures a few weeks before 
PACT testing began. Interviews, which ranged between 
45 and 90 minutes, took place at school, either in the 
weeks before, during, or after PACT testing. We tape- 
recorded and transcribed them verbatim. 

Results 

We report quantitative results first, with interview results 
following. 
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Quantitative Results 

We report quantitative results with (1) factor analysis of 
the policy stress and support questionnaire; (2) descrip- 
tive statistics; and (3) tests of the model. 

Factor Analysis 

We undertook a factor analysis of the policies stress and 
support questionnaire to create the policy pressure vari- 
able for the model. Maximum likelihood extraction 
methods, followed by an orthogonal rotation with 
Varimax procedures, resulted in four distinct factors of 
policy outcomes: support for efforts to educate students, 
general support from sources such as the state and par- 
ents, pressure from supervisors and parents, and a lack 
of fairness of teacher evaluations. 

We undertook a confirmatory factor analysis to 
determine if the four factors represented manifest vari- 
ables for the latent construct of policy pressure, but the 
model provided inadequate fit (e.g., Tucker-Lewis index 
= .84). Thus we selected one of the factors, the three- 
item help in educating students, for the model because 
its items reflected teachers’ views of accountability poli- 
cies helping them educate students, a construct closer to 
the core of the policies’ impact on their daily work than 
the other factors. The factor consisted of items saying 
that policies to increase student academic performance 
“help me more effectively impart knowledge,” “overall 
provide more support for me in my efforts to educate 
students,” and “make me feel the state is helping me 
improve students’ academic performance.” 

Descriptive Statistics 

In measures used in the model (see Table 1), policy sup- 
port was moderately low (M = 7.7 on a scale ranging 
from 3 to 18, with higher numbers indicating more sup- 
port). Personal self-efficacy was relatively high, whereas 
general self-efficacy was somewhat low. Role conflict 
occurred at moderate levels. As for burnout, emotional 
exhaustion was relatively elevated (M = 26.5), whereas 
depersonalization was low (M = 5.6) and personal 
accomplishment relatively high (M = 40.7). 

Tests of the Path Model 

The depersonalization and personal accomplishment 
subscales had high degrees of skewness, and thus we 
undertook square root transformations, resulting in rela- 
tively normal distributions. Because the transformation 
reflected the personal accomplishment scale, resulting in 
a reversal of direction of interpretation, the variable is 
called decreased personal accomplishment. 


The personal efficacy measure had a severely restrict- 
ed range, and one item (“If one of my students couldn’t 
do a class assignment, I would be able to accurately 
assess whether the assignment was at the correct level of 
difficulty”) had very low correlations with the four oth- 
ers (“When I really try, I can get through to most difficult 
students;” “If a student did not remember information I 
gave in a previous lesson, I would know how to increase 
his or her retention in the next lesson;” “If a student in 
my class becomes disruptive and noisy, I feel assured that 
I know some techniques to redirect him or her 
quickly;”and “If I try really hard, I can get through to 
even the most difficult or unmotivated students”). 
Furthermore, in the interviews almost all teachers point- 
ed to influences other than themselves (particularly 
home and family factors used in the general efficacy 
scale) as influences on student academic outcomes. 
Therefore, we utilized the general self-efficacy measure in 
the path model. 


Table 1. Means, Standard Deviation, and Scale 
Ranges for Model Measures 

Measure 

Mean 

Standard 

Deviation 

Range 

Policy support 

7.7 

3.4 

3-18 

Personal self-efficacy 

25.2 

3.3 

5-30 

General self-efficacy 

13.4 

4.7 

5-30 

Role conflict 

18.3 

6.2 

4-28 

Emotional exhaustion 

26.5 

11.5 

0-54 

Depersonalization 

5.6 

5.2 

0-30 

Personal accomplishment 

40.7 

6.1 

0-48 


As a final step before testing the model, we used SAS 
(statistical analysis software) (version 8.0) to examine 
correlations between school PACT scores, within-school 
grade-level PACT scores, self-efficacy, and policy sup- 
port. State PACT scores and grade PACT scores were 
highly correlated (r = .94, p < .0001). Flowever, neither 
school nor grade PACT scores were significantly associat- 
ed with self-efficacy (r = -.07, r = -.02, respectively) or 
policy pressure (r = -.16,r = -.15, respectively). We also 
tested these relationships using multilevel analytic tech- 
niques, with teachers (level 1) nested within grades (level 
2) within schools (level 3). Using MLwiN (version 1.1), 
we modeled the impact of state and grade PACT scores 


5 


The Effects of Education Accountability on Teachers: Are Policies Too Stress-Provoking for Their Own Good? 


(due to their high correlation, we included them in sep- 
arate models) on self-efficacy and policy pressure, 
respectively. We found no significant relationships and, 
therefore, did not utilize test scores in the model. 

We analyzed the hypothesized path model using 
maximum likelihood estimation. Results with standard- 
ized regression coefficients are presented in Figure 2. All 
paths in the model were significant, indicating no need 
for model trimming or building. The model displayed 
excellent goodness of fit by both absolute and compara- 
tive measures (see Table 2). Path coefficients ranged from 
.26 to .64, indicating generally strong relationships 
among model variables. 


Interview Results 

The first author analyzed transcripts from the 20 inter- 
views using anthropological methods centering on creat- 



ing taxonomic analyses and searching for cultural themes 
(Spradley, 1980. Then the first author wrote codes for 


Table 2. Goodness-of-Fit Indices for the Original and 
Modified Path-Analytic Model 

Fit Index 

Model 

Statistic 

Generalized Likelihood Ratio (chi-square) 

4.80 a 

Normed Fit Index 

.97 

Adjusted Goodness of Fit 

.96 

Tucker-Lewis Index 

1.0 

Standardized Root Mean Squared Residual 

.04 

a df = 8 (p = .60) 


sections of text and subsequently refined them and 
checked them for accuracy with iterative passes through 
the data. The first author next created taxonomies for 
most questions, and reviewed them for similar themes, 
resulting in a final taxonomy outlining constructs, 
including multiple types of teacher stress, related to 
accountability policies. Finally, the first author created a 
separate taxonomy of teacher suggestions for changing 
policy to improve student academic outcomes. 

Figure 2. Path-Analytic Model with Standardized 
Regression Coefficients (All Significant) 
Stressors 

Limits on teacher self-efficacy. All teachers interviewed 
made at least one reference to their influence on student 
academic performance being limited. Efficacy for 
improving academic achievement can be conceptualized 
as stemming from teachers’ personal factors, such as hav- 
ing the ability to help students, as well as limited by sit- 
uational factors, such as pupils’ home life (see Gist & 
Mitchell, 1992). Using that rubric, the results are strik- 
ing. Virtually no teachers noted any lack of ability on 
their part as limiting their efficacy. Flowever, 80 percent 
of teachers made references to home or family factors as 
having an influence on student achievement, noting 
factors such as parental involvement, socioeconomic 
status, stressors children experience, and exposure to 
opportunities to learn outside of school. One 4th grade 
teacher said this about home factors: 

... I feel in my experience that there’s a direct cor- 
relation between children who are spoken with at home 
and children who can write. Those who are spoken to 
at home can write good stories. Those who are never 
spoken to can’t write anything. Or it’s much harder for 
them. ... I think that that puts too much pressure on me, 
you know? I can only improve as much as, you know, the 
hopes are. I can only do so much. 

Many teachers — 40 percent — also felt pressured by 
student deficits or problems that could limit their aca- 
demic abilities. These responses typically included refer- 
ences to learning disabilities, low intelligence quotients, 
or other deficits. One 3rd grade teacher said: 

And once again, I’ve had certain students that have 
tested like borderline, they’re at 71 in their IQ. Well, 
they’re being tested. Now if you know you have a 71 IQ 
student, and I taught him all these standards, and I 
prepped him for PACT, do I think he’s going to do well 
on PACT? No. And I don’t think it’s because I taught him. 
I think he’s going to do what he’s capable of. 

Limited time and too many tasks. Eighty percent of 
teachers reported having either limited time to complete 
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tasks related to accountability policies or having so many 
accountability-related tasks that it was difficult to com- 
plete them. Half of the teachers discussed the lack of 
time as affecting their ability to integrate other goals. 
These included academic goals — such as working with 
all students sufficiently teaching all subjects thoroughly 
or making academic work engaging for students — as 
well as nonacademic ones — such as helping students 
develop prosocial behaviors. Thirty percent of teachers 
said education objectives and standards were so numer- 
ous that it was difficult to cover them all, particularly by 
the time standardized tests were administered. As one 
5th grade teacher said: 

Well, we’re mandated to teach certain things by the 
district and then the state has standards. Umm, and then 
they’re testing on those things, so and it’s a lot to cover 
and cover it well, and it’s just like hurry, hurry, hurry. 
And it’s not like you’re working in a factory here with lit- 
tle cookie-cutter things that just, you’ve got to have just 
some time to reach the children that maybe need a little 
bit more help and it’s so fast-paced sometimes to get 
everything in, that some of the children get left behind 
and that’s what bothers me. 

Changes in teaching practices. In a related theme, two- 
thirds of teachers discussed accountability policies 
changing either the manner in which they taught or the 
emphasis on the subjects they taught, often as a result of 
the time pressures delineated above. The most common 
change in teaching, mentioned by 35 percent of teachers, 
was no longer emphasizing that all students master prin- 
ciples before moving to the next topic. Other changes 
included de-emphasizing group work (15 percent) and 
making learning more teacher-directed, as in having 
more lectures (20 percent). A 3rd grade teacher, noting a 
day when it had snowed, said the state’s accountability 
policy: 

makes us, for better or worse, umm, stick more to 
those plans that we have in those books, which before 
we always said the plans aren’t written in stone, if you see 
a teachable moment, grab it. And we’re not grabbing 
those teachable moments anymore. I mean, if something 
were to happen like when we had snow, instead of doing 
something fun and we could have really had a great les- 
son on snow and what makes it snow, that wasn’t in our 
standards, that’s not in our curriculum, we don’t do that 
anymore. 

As for changes in teaching subjects, 20 percent of 
teachers discussed emphasizing or de-emphasizing sub- 
ject material. One 5th grade teacher decided to focus 
almost exclusively on the PACT subjects — English and 


math — six weeks before the test after fellow teachers said 
they had virtually dropped non-PACT subjects: “It’s 
almost if you talk to other people in the school, people 
will say, ‘Oh, you’re still teaching social studies? Are you 
still teaching science?’ So, it’s kind of, it’s almost school- 
wide that people have quit. ...” 

Concerns about the fairness of standardized test scores. 
Eighty-five percent of teachers had concerns about the 
ways they were evaluated based on test scores. Their rea- 
sons varied widely and included concerns that the PACT 
only indirectly measured their work because students 
took the test. The use of the scores was a large concern, 
with 40 percent believing the results were overempha- 
sized. As one 3rd grade teacher stated, “I think my prob- 
lem is really not so much with standards as it is the use 
of a state-created test as the one determining factor of 
whether those standards have been met or not.” 

Teacher Suggestions 

Teachers made a variety of suggestions for changes that 
would improve academic outcomes of students, usually 
given in response to a question about what the teacher 
would do if given the power to change policy. The sug- 
gestions are reported in four categories. 

Changes to increase fairness. Nearly two-thirds — 65 
percent — of teachers made at least one suggestion to 
enhance the fairness of the assessment of teachers and 
students, including measuring student progress with 
pre- and post-tests and having a variety of assessments 
instead of one test. 

More resources or fewer demands. Nearly two-thirds — 
65 percent — of teachers suggested reducing the 
demands on them or giving them more resources, 
including reducing class size, hiring teacher aides, or 
obtaining more updated materials or equipment. 

Changes in teacher-legislator relations. Fifty-five per- 
cent of teachers suggested changes in interactions with 
legislators. Nearly half advocated for lawmakers come to 
their classrooms to gain better understanding of teachers’ 
jobs and the education process. Others called for legisla- 
tors to involve them in the policymaking process or 
praise them more for their efforts. 

Other suggestions. More than a third — 35 percent — of 
teachers made suggestions about increasing the profes- 
sionalism of teaching. The most popular suggestion was 
an increase in salary. Fifteen percent of teachers made 
suggestions that might be considered broad-scale 
changes, such as a more even distribution of school 
funding statewide or smaller school sizes. 
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Discussion 

This study set out to discover whether elementary school 
teachers perceived accountability policies as having 
unintended consequences for their well-being. The 
answer was yes: teachers reported a myriad of negative 
policy consequences, a finding consistent with other 
investigations (e.g., Abrams, Pedulla, & Madaus, 2003; 
Barksdale -Ladd & Thomas, 2000; M. G. Jones, Jones, 
Hardin, Chapman, Yarbrough, & Davis, 1999; B. D. 
Jones & Egley, 2004, 2007; Webb, 2006). A lack of pol- 
icy support was associated with teachers’ emotional 
exhaustion via the mediators of role conflict and low self- 
efficacy. 

The interviews were consistent with the quantitative 
results. Teachers discussed numerous types of role con- 
flicts, often driven by the lack of time to impart all edu- 
cation standards. Racing to teach all standards, teachers 
often altered their teaching practices and had less time to 
pursue other goals. Similar dynamics have been docu- 
mented in other states with accountability policies 
(Barksdale-Ladd & Thomas, 2000; Lasky, 2005; McNeil, 
2000; Smith, 1991; Valli & Buese, 2007). 

In terms of self-efficacy, teachers had few concerns 
about personal factors, such as their own pedagogical 
capabilities, but often believed that situational, student- 
related factors (e.g., home conditions) played a role. In 
the context of accountability policies, efficacy has a new 
twist, as task attainment is evaluated not by teachers’ 
direct performance but rather indirectly via their stu- 
dents’ test scores. Teachers’ lack of efficacy may represent 
their relieving stress by absolving themselves from 
responsibility for student outcomes, a typical outcome 
among those who implement policies (Lipsky, 1980). 
Teachers also may be at least partly on the mark in their 
assessment of having limited efficacy, as school outcomes 
on standardized tests are highly influenced by student 
socioeconomic status (Powers, 2003). However, teach- 
ers’ pointing to student deficiencies raises the question of 
whether accountability policies lead teachers to focus on 
deficits rather than overcoming them. 

One factor that might plausibly have been related to 
teachers’ views of accountability policies and their 
burnout levels, the school PACT scores, showed no rela- 
tionship. It is likely that there was not enough variance 
in school PACT scores for differences to emerge. 
Alternatively, teachers who instruct different types of stu- 
dents may be under relatively equal pressure. Grant 
(2000) found that teachers at varying types of schools all 
experienced uncertainty over whether test scores would 


increase and that, if anything, suburban teachers were 
more concerned than urban counterparts, perhaps to 
maintain an existing high status. 

Limitations and Alternative Explanations 

This study has limitations, including use of a new meas- 
ure of policy support and reliance on teacher report 
alone. Alternative explanations for the results exist. For 
example, policy-related stress may have been less salient 
in this study had it taken place in the earlier stages of the 
school year. 

It also is possible that the variables interact in ways 
other than that prescribed in the model. An alternative 
explanation to the model’s relationships is that teachers 
already experiencing some burnout had negative reac- 
tions to South Carolina’s new accountability system. 
Even if this is so, most teachers nonetheless find the pol- 
icy unsupportive and therefore its chances for reaching 
the goal of enhanced student learning remain dimin- 
ished. 

The study also involves teachers from one locale and 
does not examine the multiple levels of policy, which in 
the United States can pass from the federal government 
to states to districts to schools and finally to teachers 
themselves. Context is likely to play a strong role in the 
impact of policies on teachers, because their responses to 
accountability are affected by factors such as principal 
support (Kelley & Finnigan, 2003) and district adminis- 
trators (Louis, Febey, & Schroeder, 2005). 

A final limitation is that the data examined in this 
study were collected in 2000, and teachers’ views of 
accountability may have changed either with time, 
changes in policy, or both. However, data were collected 
after the passage of South Carolina’s law that established 
the accountability framework lasting through 2008. 
Replications of this study would be helpful to determine 
whether teachers’ views are changing. 

Implications for Policy 

Given the limitations of the study, we do not suggest that 
policy be reformulated solely based on its results. 
However, the possibility of accountability policies result- 
ing in diminished job engagement has implications that 
policymakers should consider. Principally, if teachers 
become emotionally exhausted, the long-term outcome 
may be leaving the profession or decreased commitment 
and enthusiasm (Leiter & Maslach, 1998). Either result 
is unlikely to lead to what accountability policies seek: 
better-prepared students. 
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What can policymakers do to promote teachers 
being engaged to lead students to better academic out- 
comes without pressures that may push them toward 
burnout? One possibility is undertaking stress-reduction 
interventions with individual teachers. However, these 
strategies are unlikely to appeal to policymakers because 
they often encourage less work or lowered expecta- 
tions — and may run counter to the hope of increasing 
student academic achievement. Furthermore, burnout 
results from both individual and contextual factors 
(Kelchtermans & Strittmatter, 1999). Making changes in 
individuals when the system is part of the problem leaves 
basic structures intact and is unlikely to affect the prob- 
lem (Seidman, 1988). Therefore, policymakers should 
consider making changes for teachers rather than in 
teachers. 

Teachers in this study suggested such changes, and 
we believe policymakers would be wise to pay attention 
to them. One suggestion was having more opportunities 
to interact with legislators about education and policy. 
Teachers may take the initiative to contact legislators or 
have their opinions heard through their union or other 
means, but their voices are often not strongly present in 
debates over policies or reforms (M. G. Jones et al., 1999; 
S. D. Miller, Hayes, & Atkinson, 1997; Smith & Fey, 
2000). If state officials don’t provide an invitation and 
process in which teachers can provide input, they risk 
mistrusting the people they most need to make their 
policies work (for an example of this process in action, 
see S. D. Miller et ah, 1997). Giving teachers a means to 
have more policy input also makes policies more consis- 
tent. The policy establishing new standards and tests in 
South Carolina strongly emphasizes that students 
employ problem solving and critical thinking. However, 
the state has not given teachers the opportunity to 
engage in those types of processes to formulate or 
improve policies (similar inconsistencies are discussed in 
Noble & Smith, 1994). 

Teachers’ other suggestions in this study included 
more professionalism in teaching. We believe an impor- 
tant step in that direction would be more widespread 
adoption of a professional development model, a step 
recommended for accountability policies (Sirotnik & 
Kimball, 1999). We define professional development as 
Lieberman and L. Miller (1990) do: a dramatic alteration 
of time and resources that creates schools of profession- 
al practice in which there are opportunities for activities 
such as teacher study groups to discuss areas of profes- 
sional interest and peer observations to critique strengths 
and weaknesses. This represents an investment in 


improving the quality of teaching, a variable that has 
been more effective in the improvement of academic 
achievement than implementation of student testing 
(Darling-Hammond, 2000, 2004). Furthermore, there is 
evidence that professional development schools can 
serve as buffers against burnout (L. Miller, 1999) and 
that schools incorporating aspects of professional devel- 
opment result in teachers having increased efficacy 
(Chester & Beaudin, 1996; Henson, 2001). 

A related trend toward professionalism in teaching 
has been the practice of teacher leadership (Katzenmeyer 
& Moller, 2001). A broad construct, teacher leadership 
involves teachers taking lead roles in tasks traditionally 
given to administrators, giving them influence and deci- 
sion-making power in areas such as school management, 
curriculum, school improvement, and community 
involvement (York-Barr & Duke, 2004). There are 
numerous examples of teachers taking leadership roles in 
the United States and other nations (e.g., Crowther, 
Kaagan, Ferguson, & Hann, 2002; Frost, Durrant, Head, 
& Holden, 2000; Katzenmeyer & Moller; Lieberman & 
L. Miller, 2004; Muijs & Harris, 2006). While there is a 
need to know more about the effects of teacher leader- 
ship (York-Barr & Duke, 2004), evidence indicates that 
teachers sharing leadership with principals can have a 
positive impact on student achievement (Marks & Printy, 
2003; Printy & Marks, 2006). Teacher leadership thus 
offers a potential means of increasing efficacy and lower- 
ing burnout. 

Another suggestion teachers made in this study was 
to enhance the fairness of assessments of teachers and 
students. This issue is receiving increasing attention, as 
there is empirical evidence that, for example, the No 
Child Left Behind’s measurement of academic progress 
lacks validity (Balfanz, Legters, West, & Weber, 2007). 
The federal government is granting some flexibility to 
states to change measurement rubrics, including using 
learning gains (Peterson & West, 2006). South Carolina 
is not among those states, although the state’s accounta- 
bility law tracks student progress. Another means to 
increase fairness would be to replace a single high-stakes 
test with a variety of measures, a step that Linn (2000) 
views as essential. Because more evaluations could 
increase stress on teachers, we advocate following Porter 
and Chester’s (2002) recommendation that multiple 
measures be employed with the philosophy of promot- 
ing more opportunities for students to succeed — a plan 
that has been undertaken in schools districts such as 
Philadelphia’s (Porter, Chester, & Schlesinger, 2004). 
Creating what Porter and Chester call “a bias in favor of 
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success” (p. 290) could ease pressure and increase sup- 
port for teachers and students. If multiple measures are 
impractical, districts could invest in software that almost 
instantaneously assesses student progress in areas like 
math. Yeh (2006) found that teachers using such soft- 
ware were able to efficiently assess student weaknesses 
and counteract them, potentially reducing the stress of 
high-stakes testing. 

As a final consideration, teachers in this study often 
focused on the outside forces in students’ lives that may 
limit their academic outcomes. It is our hope that teach- 
ers not overemphasize these forces, but they are correct 
in seeing that factors beyond individual students have an 
impact in school. 

One means for schools to have an effect on “outside 
forces” is through fostering greater family and communi- 
ty involvement in schools, which the No Child Left 
Behind Act requires (Epstein, 2005). Although a variety 
of factors affect parental involvement, it is influenced by 
schools, with teachers often playing important roles 
(Hoover-Dempsey et al., 2005). Strongly implemented 
partnerships of schools, families, and communities can 
lead to better outcomes for students, such as better atten- 
dance (Sheldon, 2007). Hence, teachers who are sup- 
ported in involving parents may move away from the 
assumption that there are factors influencing student 
achievement that they cannot change. 

However, we hope that policymakers who want 
improved academic outcomes will not stop at promoting 
school-community relations but also will consider 
addressing social and economic disparities to help all 
students come to school without disadvantages (Taylor, 
1994). This may be idealistic, but without such changes, 
what teachers are asked to do under accountability poli- 
cies is akin to asking physicians to return all patients, 
regardless of their initial condition, to optimal health. 
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Appendix A: Accountability Policies Stressors and Supports Questionnaire 


For me, policies to increase student academic performance: 

1 . Result in my getting more support from the state 

2. Result in my getting more support from parents 

3. Result in my getting more support from the school district 

4. Result in my getting more support from my supervisor 

5. Result in my getting more support from the public 

6. Result in more pressure on me from the public 

7. Result in more pressure on me from my supervisor 

8. Result in more pressure on me from the state 

9. Result in more pressure on me from parents 

10. Result in more pressure on me from my school district 

11. Make me feel like my job performance is being fairly evaluated by the public 

12. Make me feel like my job performance is being fairly evaluated by my supervisor 

13. Make me feel like my job performance is being fairly evaluated by lawmakers 

14. Make me feel like my job performance is being fairly evaluated by my school district 

15. Help me focus more on what I should be doing 

16. Make me choose between what I think children need and what the state says I should do 

17. Help me more effectively impart knowledge 

18. Result in standards that make it difficult to teach students who have different abilities 

19. Overall put more stress in my job 

20. Overall ease the tasks I have in teaching 

2 1 . Overall provide more support for me in my efforts to educate students 

22. Overall add a lot of burdens to my job 

23. Make me feel the state is helping me improve students’ academic performance 

24. Set standards that just are not reachable for all students 
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